Can big data and random forests improve avalanche runout estimation compared to simple linear regression?

نویسندگان

چکیده

Accurate prediction of snow avalanche runout-distances in a deterministic sense remains challenge due to the complexity all physical properties involved. Therefore, many locations including Norway, it has been common practice define runout distance using angle from starting point end zone (α-angle). We use large dataset events Switzerland (N = 18,737) acquired optical satellites calculate α-angle for each avalanche. The α-angles our are normally distributed with mean 33° and standard deviation 6.1°, which provides additional understanding insights into distribution. Using feature importance module Random Forest framework, we found most important topographic parameter predicting be average gradient release area β-point. Despite modern machine learning (ML) method, simple linear regression model yield higher performance than ML attempts. This means that is better an operational context.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Random Forests for Big Data

Big Data is one of the major challenges of statistical science and has numerous consequences from algorithmic and theoretical viewpoints. Big Data always involve massive data but they also often include data streams and data heterogeneity. Recently some statistical methods have been adapted to process Big Data, like linear regression models, clustering methods and bootstrapping schemes. Based o...

متن کامل

Estimation of suspended sediment concentration and yield using linear models, random forests and quantile regression forests

For sediment yield estimation, intermittent measurements of suspended sediment concentration (SSC) have to be interpolated to derive a continuous sedigraph. Traditionally, sediment rating curves (SRCs) based on univariate linear regression of discharge and SSC (or the logarithms thereof) are used but alternative approaches (e.g. fuzzy logic, artificial neural networks, etc.) exist. This paper p...

متن کامل

Implementation of Random Forest Algorithm in Order to Use Big Data to Improve Real-Time Traffic Monitoring and Safety

Nowadays the active traffic management is enabled for better performance due to the nature of the real-time large data in transportation system. With the advancement of large data, monitoring and improving the traffic safety transformed into necessity in the form of actively and appropriately. Per-formance efficiency and traffic safety are considered as an im-portant element in measuring the pe...

متن کامل

Strategies to improve neuroreceptor parameter estimation by linear regression analysis.

In an attempt to improve neuroreceptor distribution volume (V) estimates, the authors evaluated three alternative linear methods to Logan graphical analysis (GA): GA using total least squares (TLS), and two multilinear analyses, MA1 and MA2, based on mathematical rearrangement of GA equation and two-tissue compartments, respectively, using simulated and actual PET data of two receptor tracers, ...

متن کامل

Robust linear registration of CT images using random regression forests

Global linear registration is a necessary first step for many different tasks in medical image analysis. Comparing longitudinal studies 1 , cross-modality fusion 2 , and many other applications depend heavily on the success of the automatic registration. The robustness and efficiency of this step is crucial as it affects all subsequent operations. Most common techniques cast the linear registra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Cold Regions Science and Technology

سال: 2023

ISSN: ['0165-232X', '1872-7441']

DOI: https://doi.org/10.1016/j.coldregions.2023.103844